ME R-CNN: Multi-Expert R-CNN for Object Detection
نویسندگان
چکیده
Recent CNN-based object detection methods have drastically improved their performances but still use a single classifier as opposed to ”multiple experts” in categorizing objects. The main motivation of introducing multi-experts is twofold: i) to allow different experts to specialize in different fundamental object shape priors and ii) to better capture the appearance variations caused by different poses and viewing angles. The proposed approach, referred to as multi-expert Region-based CNN (ME R-CNN), consists of three experts each responsible for objects with particular shapes: horizontally elongated, square-like, and vertically elongated. Each expert is a network with multiple fully connected layers and all the experts are preceded by a shared network which consists of multiple convolutional layers. On top of using selective search which provides a compact, yet effective set of region of interests (RoIs) for object detection, we augmented the set by also employing the exhaustive search for training. Incorporating the exhaustive search can provide complementary advantages: i) it captures the multitude of neighboring RoIs missed by the selective search, and thus ii) provide significantly larger amount of training examples to achieve the enhanced accuracy.
منابع مشابه
Face R-CNN
Faster R-CNN is one of the most representative and successful methods for object detection, and has been becoming increasingly popular in various objection detection applications. In this report, we propose a robust deep face detection approach based on Faster R-CNN. In our approach, we exploit several new techniques including new multi-task loss function design, online hard example mining, and...
متن کاملLearning Oriented Region-based Convolutional Neural Networks for Building Detection in Satellite Remote Sensing Images
The automated building detection in aerial images is a fundamental problem encountered in aerial and satellite images analysis. Recently, thanks to the advances in feature descriptions, Region-based CNN model (R-CNN) for object detection is receiving an increasing attention. Despite the excellent performance in object detection, it is problematic to directly leverage the features of R-CNN model...
متن کاملMulti-Channel CNN-based Object Detection for Enhanced Situation Awareness
Object Detection is critical for automatic military operations. However, the performance of current object detection algorithms is deficient in terms of the requirements in military scenarios. This is mainly because the object presence is hard to detect due to the indistinguishable appearance and dramatic changes of object’s size which is determined by the distance to the detection sensors. Rec...
متن کاملMulti-region Two-Stream R-CNN for Action Detection
Motivation: I Previous work shows improvement with better proposal methods [1] I State-of-the-art CNN based action classi cation relies on multi-frame optical ow [2] I Object recognition is improved by multiple-region feature [3] Contribution: I We introduce a motion Region Proposal Network (RPN) I We show that multi-frame optical ow signi cantly improves action detection I We embed a multi-reg...
متن کاملObject Detection in Video using Faster R-CNN
Convolutional neural networks (CNN) currently dominate the computer vision landscape. Recently, a CNN based model, Faster R-CNN [1], achieved stateof-the-art performance at object detection on the PASCAL VOC 2007 and 2012 datasets. It combines region proposal generation with object detection on a single frame in less than 200ms. We apply the Faster R-CNN model to video clips from the ImageNet 2...
متن کامل